Method and apparatus for delivery of metadata synchronized to multimedia contents

ABSTRACT

A metadata transmitter synchronized with multimedia contents comprises: a multimedia contents authoring unit for generating and editing multimedia contents; a multimedia contents format converter for compressing the multimedia contents, converting them into a transmission format, and outputting them; a metadata authoring unit for generating and editing metadata for describing the multimedia contents, the metadata including transmission types and transmission information; a metadata format converter for converting the metadata into binary codes, converting the converted metadata into a synchronization format for synchronization with the multimedia contents and a transmission format, and outputting them; and a multiplexer for multiplexing the multimedia contents format and the metadata format respectively output from the multimedia contents format converter and the metadata format converter into a stream, and outputting it.

CROSS REFERENCE TO RELATED APPLICATION

This application is based on Korea Patent Application No. 2001-4341 filed on Jan. 30, 2001 in the Korean Intellectual Property Office, the content of which is incorporated herein by reference.

BACKGROUND OF THE INVENTION

(a) Field of the Invention

The present invention relates to an apparatus and method for synchronizing metadata with multimedia contents, and transmitting them.

(b) Description of the Related Art

Metadata description methods for representing Essence, which is multimedia contents, and their standardization activities are now in progress. However, prior art only disclose metadata description methods and do not include synchronization and transmission methods of the multimedia contents and related metadata. The specifications of the metadata description method are found from MPEG, SMPTE, and TV.Anytime.

SUMMARY OF THE INVENTION

It is an object of the present invention to provide a method for synchronizing metadata with multimedia contents, and transmitting them, and for a terminal to receive the multimedia contents including the metadata and use them.

In one aspect of the present invention, a metadata transmitter synchronized with multimedia contents comprises: a multimedia contents authoring unit for generating and editing multimedia contents; a multimedia contents format converter for compressing the multimedia contents, converting them into a transmission format for synchronization and transmission, and outputting them; a metadata authoring unit for generating and editing metadata for describing the multimedia contents, the metadata including transmission types and transmission information; a metadata format converter for converting the metadata into binary codes, converting the converted metadata into a synchronization format for synchronization with the multimedia contents and a transmission format for transmission, and outputting them; and a multiplexer for multiplexing the multimedia contents format and the metadata format respectively output from the multimedia contents format converter and the metadata format converter into a stream, and outputting it.

The metadata format converter comprises: a metadata synchronization format converter for converting the metadata transmitted from the metadata authoring unit into binary codes, and converting them into a synchronization format for synchronization with the multimedia contents; and a metadata transmission format converter for converting the data output from the metadata synchronization format converter, according to a transmission format.

In another aspect of the present invention, a method for synchronizing metadata with multimedia contents and transmitting them comprises: (a) generating and editing metadata for describing multimedia contents, the metadata including transmission types and transmission information; (b) converting the metadata into binary codes, and converting the converted metadata into a synchronization format for synchronization with the multimedia data; and (c) converting the metadata converted in (b) into a transmission format for transmission.

BRIEF DESCRIPTION OF THE DRAWINGS

The accompanying drawings, which are incorporated in and constitute a part of the specification, illustrate an embodiment of the invention, and, together with the description, serve to explain the principles of the invention:

FIG. 1 shows a metadata transmission system according to a preferred embodiment of the present invention;

FIG. 2 shows a metadata format converter according to a preferred embodiment of the present invention;

FIG. 3 shows a flowchart of a method for transmitting metadata synchronized with MPEG-2 data according to a preferred embodiment of the present invention;

FIG. 4 shows definitions of stream identifiers used for transmitting the metadata synchronized with MPEG-2 data according to a preferred embodiment of the present invention;

FIG. 5 shows definitions of stream-type values used for transmitting the metadata synchronized with MPEG-2 data according to a preferred embodiment of the present invention;

FIG. 6 shows an exemplified PES packet for synchronizing synchronous metadata with MPEG-2 data according to a preferred embodiment of the present invention; and

FIG. 7 shows an exemplified PES packet for synchronizing synchronized metadata with MPEG-2 data according to a preferred embodiment of the present invention.

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS

In the following detailed description, only the preferred embodiment of the invention has been shown and described, simply by way of illustration of the best mode contemplated by the inventor(s) of carrying out the invention. As will be realized, the invention is capable of modification in various obvious respects, all without departing from the invention. Accordingly, the drawings and description are to be regarded as illustrative in nature, and not restrictive.

FIG. 1 shows a metadata transmission system according to a preferred embodiment of the present invention. As shown, the metadata transmission system comprises a multimedia contents authoring unit 100; a multimedia contents format converter 200; a metadata authoring unit 300; a metadata format converter 400; and a multiplexer 500.

The multimedia contents authoring unit 100 generates multimedia contents, edits them, and outputs them to the multimedia contents format converter 200. In this instance, the multimedia authoring process includes processes of generating and editing the multimedia data, and the editing process does not specify a particular process excepting auxiliary tasks including correcting and adding generated multimedia data.

The multimedia contents format converter 200 compresses the multimedia contents input from the multimedia contents authoring unit 100, converts them into transmission format data for synchronization and transmission, and outputs them to the multiplexer 500. The multimedia contents format converter 200 performs synchronization format conversion and transmission format conversion. According to the embodiment of the present invention, the synchronization format includes: MPEG-2 PES (packetized elementary stream) packets, MPEG-4 SL (sync layer) packets, MPEG-4 FlexMux packets, and RTP (real time protocol) standard specifications, and the transmission format includes: MPEG-2 TS (transport stream), IP (Internet protocol), and ATM (asynchronous transfer mode) standard specifications.

That is, the multimedia contents format converter 200 compresses the multimedia contents using at least one method of standard specifications of MPEG-1, MPEG-2, MPEG-4, H.261, H.263, and H.26L. In other words, for example, it compresses some of the multimedia data using the MPEG-4 standard and compresses a remaining portion of the multimedia data using the H.263 standard, so the whole of the multimedia data may comprise the MPEG-4 data and the H.263 data.

After the compression process, the multimedia contents format converter 200 converts the compressed multimedia contents into a synchronization format using at least one standard specification of the MPEG-2 PES packet, the MPEG-4 SL packet, the MPEG-4 FlexMux packet, and the RTP packet, and converts them into a transmission format using at least one standard specification of the MPEG-2 TS, the IP, and the ATM.

The metadata authoring unit 300 generates and edits metadata for describing the multimedia contents, and outputs them to the metadata format converter 400. According to the embodiment of the present invention, the metadata authoring unit 300 performs an authoring process using one of MPEG-7, SMPTE (Society of Motion Picture and Television Engineers), TV Anytime, and EBU (European broadcasting union) standard specifications on the XML (extensible markup language). In this instance, the metadata authoring unit 300 concurrently generates transmission types and transmission information at the time of authoring.

The metadata format converter 400 converts the metadata input from the multimedia contents authoring unit 100 into binary codes, converts them into a transmission format for synchronization and transmission, and outputs them to the multiplexer 500. The metadata format converter 400 performs synchronization format conversion and transmission format conversion. The synchronization format includes data characteristics, relations with whole streams, time information, and length information of a charged load, and the transmission format representing a format needed for transmitting packetized data includes sequence information and data types of the charged load.

That is, the metadata format converter 400 converts the metadata into binary codes using at least one of the MPEG-7, the SMPTE, the TV-Anytime, and the EBU standard specifications, converts the converted metadata into a synchronization format using at least one of the MPEG-2 PES packet, the MPEG-4 SL packet, the MPEG-4 FlexMux packet, and the RTP packet standard specifications, and converts them into a transmission format using at least one of the MPEG-2 TS, the IP, and the ATM standard specifications.

The multiplexer 500 multiplexes the multimedia contents input from the multimedia contents format converter 200 and the metadata input from the metadata format converter 400 into a single stream, and transmits it to a transmission network 600.

An interactive terminal 700 processes the stream transmitted via the transmission network 600 so that a user may use desired multimedia contents and metadata.

FIG. 2 shows the metadata format converter 400 according to a preferred embodiment of the present invention. As shown, the metadata format converter 400 comprises: a metadata synchronization format converter 420; and a metadata transmission format converter 440.

The metadata synchronization format converter 420 converts the XML-language metadata transmitted from the metadata authoring unit 300 into binary codes, and converts them into a synchronization format. The metadata transmission format converter 440 converts the data transmitted from the metadata synchronization format converter 420 into predetermined data according to respective transmission formats, and outputs them to the multiplexer 500.

In this instance, the subsequent two methods can be used to synchronize the metadata according to the preferred embodiment of the present invention.

The first method is to packetize the metadata into packets identical with those for transmitting speech and image data. In detail, the metadata are packetized in the sequential order of the RTP packet and the IP packet in the Internet network case, they are packetized into TS packets after PES or section packetizing in the broadcasting network case, and they are sequentially packetized in the order of the SL packet and the FlexMux packet in the MPEG-4 case. In this instance, since the RTP packet, the PES packet, or the SL packet for packetizing the metadata has decoding time or output time value, it is required to packetize the metadata according to the time value. However, the first method is required to support each network's decoder model according to categories of transmitting networks. That is, since it is needed for the decoder to analyze the packets that have respective networks' time values and to connect to a decoder for decoding the metadata, it is impossible to amend to each system decoder model.

The second method is to convert the metadata into a synchronization format and synchronize it with multimedia data. This method enables to synchronize data and transmit them with no relation to the transmission networks. In this instance, it is necessary for the decoder model is to use the decoder model of the metadata without using that of each transmission network. Also, since the metadata synchronization format has independent decoding time and output time values, it enables to operate the decoder model and support synchronization. In this instance, the decoding time value and the output time value refer to the metadata's time default value and time reference value to represent the metadata's decoding time and output time.

The metadata synchronization format converter 420 comprises: a metadata binary code converter 421; a metadata synchronous format converter 422; a packetizing controller 423; an RTP packetizer 424; an MPEG-2 packetizing controller 425; a PSI (program specific information) generator 426; a PES packetizer 427; a section packetizer 428; and an SL packetizer 429.

The metadata binary code converter 421 converts the metadata stored in the XML language into binary codes so as to transmit the metadata generated from the metadata authoring unit 300. The metadata synchronous format converter 422 converts the binary codes into a metadata synchronization format so as to synchronize and transmit them with no relation to the transmission networks. In this instance, the metadata synchronization format independently has decoding time and output time values so as to operate the decoder model and support synchronization. Also, the decoding time value and the output time value refer to the metadata's time default value and time reference value to represent the metadata's decoding time and output time.

The packetizing controller 423 selects a metadata's transmission network so as to make the transmission network of the multimedia contents coincide with that of the metadata.

The RTP packetizer 424 packetizes the metadata into an RTP, and the SL packetizer 429 packetizes synchronous, synchronized, and asynchronous metadata into an MPEG-4 SL packet.

In this instance, the technical terms “synchronous” and “synchronize” are generally used in data broadcasting. To synchronize is to match an image frame with an audio frame at a time axis so as to regulate syncs of images and speech, or to match an axis of additional data with a time axis that matches the speech with the images. To be synchronous is to match additional data with another independently-operating time axis that does not correspond to the time axis for synchronizing the speech or images.

The MPEG-2 packetizing controller 425 classifies the metadata input to be packetized into an MPEG-2 system specification, as the metadata that have synchronization time values and other metadata that do not have them, outputs the metadata that have synchronization time values to the PES packetizer 427, outputs the metadata that do not have synchronization time values to the section packetizer 428, and transmits PSI information including metadata transmission types and transmission information to the PSI generator 426.

In this instance, the PSI for representing information defined for a decoder to decode programs includes: a PAT (program association table); a PMT (program map table); an NIT (network information table); and a CAT (conditional access table). The PAT and the PMT represent information on program elements that form a program, the NIT shows information on the transmission networks, the CAT indicates information on conditional receiving, and the PES represents a data structure used for carrying elementary streams.

Since the metadata are transmitted in the PES or sections, the PSI generator 426 receives a transmission type and transmission information and records them on the PMT section so as to provide related terminal information.

The PES packetizer 427 packetizes the synchronous and synchronized metadata into an MPEG-2 PES. Since a PES packet header includes a DTS (decoding time stamp) and a PTS (presentation time stamp), synchronization is possible based on them.

The section packetizer 428 packetizes asynchronous metadata into sections. Since a section header does not include synchronous and synchronized time values, it is used for transmitting asynchronous metadata.

The metadata transmission format converter 440 comprises: an IP packetizer 441; a TS packetizer 442; and a FlexMux packetizer 443. The IP packetizer 441 packetizes the metadata into an IP, and the FlexMux packetizer 443 packetizes the metadata into a FlexMux.

In this instance, the FlexMux represents a multiplexing method of options provided by the MPEG-4 system. That is, the FlexMux packet is used for reducing an overhead of a transmission multiplexer (TranMux) or allocating a channel of the transmission multiplexer when multiplexing a plurality of streams. In general, the MPEG-4 stream is to be packetized into an SL packet in a sync layer, but the overhead can be reduced by packetizing one or a plurality of SL packets into a single FlexMux packet. Also, each MPEG-4 stream together with a logical channel is to be transmitted to a terminal from a server, and the FlexMux packet allocates logical channels for the respective MPEG-4 streams.

The TS packetizer 442 packetizes a PMT table input from the PSI generator 426, metadata input from the PES packetizer 427, and metadata input from the section packetizer 428 into transport streams (TS).

A method for using MPEG-2 data as multimedia contents, synchronizing the metadata with the multimedia contents, and transmitting them in a digital broadcasting will now be described.

FIG. 3 shows a flowchart of a method for synchronizing the metadata with the MPEG-2 data and transmitting them according to a preferred embodiment of the present invention.

When metadata are input from the metadata authoring unit 300 and the metadata binary code converter 421 in step S9, the input metadata are analyzed in step S10. In this instance, it is determined whether they need to be synchronized with the MPEG-2 data in step S11, and when needed, they are packetized into PES packets in step S12, and when not needed, they are packetized into private sections in step S13. Also, the metadata are analyzed to generate PSI in step S14, and the generated PSI, the PES, or the data packetized into private sections are packetized into TS packets in step S15. The TS-packetized metadata are multiplexed with MPEG-2 audio/video TS through an input of a synchronization initial value to be output as a single TS in step S16. In a detailed method for synchronizing the metadata with the MPEG-2 data, a metadata time default value and a metadata time reference value are defined and used so as to synchronize the metadata with a system time reference value, that is, an STC (system time clock), and a program time reference value, that is, a PCR (program clock reference) defined by the MPEG-2 system standard.

Since the STC defined by the MPEG-2 system standard is an STC operating at 27 MHz, the STC is to be cooperated with the metadata time default value as a basic condition for synchronizing the metadata with the MPEG-2 data, which is expressed in Equation 1.

f _(STC)(t)/f _(MetadataTimeBase)(t)=+Integer  Equation 1

where f_(STC)(t) represents a system clock signal of 27 MHz, and f_(MetadataTimeBase)(t) indicates a metadata time default value.

Further, since the PCR defined by the MPEG-2 system standard is a PCR sampled by 90 KHz, the metadata time reference value is divided by the integer of 90 KHz so as to synchronize the metadata with the PCR, which is expressed in Equation 2.

(f _(STC)(t)/300)f _(MetadataClockReference)=+Integer  Equation 2

where (f_(STC)(t)/300) represents 90 KHz, and f_(MetadataClockReference) indicates a metadata time reference value.

For further detailed description of Equations 1 and 2, in the MPEG-2 system, the system clock signals are to be timed at 27 MHz so as to match the operation of the encoder and the decoder. For this purpose, the operation of the encoder at 27 MHz is to be provided to the decoder, which is enabled by transmitting the PCR that are values sampled at 90 KHz. The reason for transmitting the values sampled at 27 MHz/300=90 KHz is to maintain the compatibility between the MPEG-1 and the MPEG-2, since the MPEG-1 operates at 90 KHz. In this instance, Equation 1 represents that since the system clock of multimedia data operates at 27 MHz, the clock of the metadata is to operate at a clock signal divided by an integer corresponding to this, thereby enabling synchronization between them. In the like manner, Equation 2 shows that a metadata time reference value is to have a time reference value, with respect to the multimedia data transmitting a time reference value sampled by 90 KHz, as many as the number obtained by dividing 90 KHz by an integer so as thus to enable synchronization between them.

In the preferred embodiment of the present invention, in order to synchronize the metadata that require synchronization with the MPEG-2 data and transmit them, the metadata are packetized into access units using the MPEG-2 system standard. That is, to synchronize the metadata with the MPEG-2 data, the metadata are packetized into packets using the PES packets as defined by the MPEG-2 system standard, and they are transmitted using the TS. In order to packetize the metadata into PES packets through a detailed implementation method for synchronizing the metadata with the MPEG-2 data, a stream identifier (stream_id) of a PES packet header defined by the MPEG-2 system standard is extended as follows.

The stream identifier (stream_id) of the PES packet for transmitting the metadata is a field that represents what category of data the charged load of the PES packet is. Stream identifier values for the metadata are not defined in the current international standard, but the present embodiment defines a stream_id for the metadata and uses it, and accordingly, the metadata may be carried on the charged load of the PES packet to be transmitted, which can be expressed as follows.

PES_packet( ) { Packet_start_code_prefix Stream_id = Metadata stream PES_packet_length ... ... }

In this instance, a value 0×FC is allocated as a stream identifier for a newly defined metadata stream as shown in FIG. 4.

Also, in the preferred embodiment of the present invention, in order to transmit the metadata that do not require synchronization, the metadata are packetized using the MPEG-2 system standard. In order to transmit the metadata without synchronizing them, the metadata are packetized using the private sections, and they are transmitted using the TS as defined by the MPEG-2 system standard.

In the preferred embodiment of the present invention, in order to transmit the metadata and apply them to a terminal, a message that the metadata are transmitted is reported to the terminal by using the MPEG-2 system standard. That is, in order to report a metadata transmission notice to the terminal using the MPEG-2 system, a stream type of a PMT table header defined by the MPEG-2 system standard is extended as follows.

TS_program_map_section ( ) { table_id section_syntax_indicator ’0’ ... // Video stream_type = 0x03 (ISO/IEC 13818-2 Video) reserved elementary_PID // Audic stream_type = 0x04 (ISO/IEC 13818-3 Audio) reserved elementary_PID ... ... // Metadata stream_type = 0x15 (Metadata stream carried in PES packets) reserved elementary_PID stream_type = 0x16 (Metadata stream carried in Private Section) reserved elementary_PID ... } CRC_32 }

As described above, the PMT represents information on the element bit streams configuring a program, defines identifiers of respective element bit streams, and adds descriptors to show information on detailed element bit streams. However, since the current standard does not have stream_type values for the metadata in the PMT table in a similar manner as definition of stream_id, a stream_type value is to be set so as to configure the metadata into data related to a single program. Hence, the present invention defines and uses the stream_type value to synchronize the multimedia data with the metadata and transmit them.

As shown in FIG. 5, the stream-type values for the newly defined metadata stream have a value of 0×15 in the case of the metadata transmitted to the PES packet, and a value of 0×16 in the case of the metadata transmitted to the private section.

Finally, in the preferred embodiment of the present invention, in order to synchronize the metadata that require synchronization with the MPEG-2 data and transmit them, a CTS (composition time stamp)/DTS (decoding time stamp) time value of a metadata access unit is used as an input of a PTS (presentation time stamp)/DTS time value when packetizing the metadata into PES packets.

The metadata for being synchronized with the MPEG-2 data are classified in two ways. The first is as synchronous metadata, and the second is as synchronized metadata. Since the synchronous metadata stream is organically operated, the synchronous metadata can be synchronized with the multimedia contents by adding a synchronization initial value (Offset) to each CTS time value of the metadata stream to generate a PTS value, which is expressed in Equation 3.

PTS(t)=CTS(t)+“Offset”  Equation 3

FIG. 6 shows a PES packet format for synchronizing the metadata with the MPEG-2 data.

Since the synchronized metadata is not organically operated, the synchronized metadata can be synchronized with the MPEG-2 data by inputting each CTS time value of the metadata stream through a value identical with that of a PTS time value, which can be expressed as in Equation 4.

PTS(t)=CTS(t)

FIG. 7 shows a PES packet format for synchronizing the synchronized metadata with the MPEG-2 data. Through the above process, the synchronous and synchronized metadata can be synchronized with the multimedia contents, and they are packetized into 188-byte TS packets and multiplexed with input MPEG-2 audio/video TS so as to transmit them.

According to the present invention, a detailed implementation method for synchronizing the metadata used as additional information in the digital broadcasting with the MPEG-2 data and transmitting them is provided, thereby enabling transmitting the metadata in real-time, enabling the user's random access, and applying the two kinds of data in various ways.

While this invention has been described in connection with what is presently considered to be the most practical and preferred embodiment, it is to be understood that the invention is not limited to the disclosed embodiments, but, on the contrary, is intended to cover various modifications and equivalent arrangements included within the spirit and scope of the appended claims. 

1. A method for transmitting metadata synchronized with MPEG-2 multimedia content, the method comprising: (a) converting the metadata into binary codes; (b) converting the binary codes into a synchronization format suitable for synchronization with the MPEG-2 multimedia content; and (c) packetizing the synchronization format into a metadata MPEG-2 packetized elementary stream (PES) packet, wherein the metadata MPEG-2 PES packet includes a metadata access unit (AU) and a stream ID for representing the metadata.
 2. The method of claim 1, wherein the metadata MPEG-2 PES packet further includes a presentation time stamp (PTS) and a decoding time stamp (DTS).
 3. The method of claim 2, wherein the PTS is established referring to an output time of the metadata AU.
 4. The method of claim 3, wherein the PTS is made by adding a synchronization initial value (Offset) to the output time of the metadata AU.
 5. The method of claim 3, wherein the PTS is identical to the output time of the metadata AU.
 6. The method of claim 1, further comprising: (d) converting the metadata MPEG-2 PES packet into a metadata MPEG-2 transport stream (TS); and (e) multiplexing the metadata MPEG-2 TS and a MPEG-2 audio/video TS into a single TS.
 7. The method of claim 6, wherein the metadata MPEG-2 PES packet further includes a presentation time stamp (PTS) and a decoding time stamp (DTS).
 8. The method of claim 7, wherein the PTS is established referring to an output time of the metadata AU.
 9. The method of claim 8, wherein the PTS is made by adding a synchronization initial value (Offset) to the output time of the metadata AU.
 10. The method of claim 8, wherein the PTS is identical to the output time of the metadata AU.
 11. A method for decoding a transport stream which is multiplexed by a MPEG-2 multimedia content and a metadata synchronized with the MPEG-2 multimedia content, the method comprising: (a) receiving the transport stream and converting it into a metadata MPEG-2 packetized elementary stream (PES) packet, wherein the metadata MPEG-2 PES packet includes a metadata access unit (AU) and a stream ID for representing the metadata, and a presentation time stamp (PTS); and (b) decoding the metadata referring to the metadata MPEG-2 PES to be synchronized with the MPEG-2 multimedia content.
 12. The method of claim 11, wherein the PTS is used to generate an output time of the metadata AU.
 13. The method of claim 12, wherein the output time of the metadata AU is generated by offsetting the PTS with a synchronization initial value (offset value).
 14. The method of claim 12, wherein the PTS is identical to the output time of the metadata AU.
 15. A MPEG-2 packetized elementary stream (PES) packet comprising: a metadata access unit (AU); a stream ID for representing a metadata; a presentation time stamp (PTS); and a decoding time stamp (DTS).
 16. The MPEG-2 PES packet of claim 15, wherein the PTS is established referring to an output time of the metadata AU.
 17. The MPEG-2 PES packet of claim 16, wherein the PTS is made by adding a synchronization initial value (Offset) to the output time of the metadata AU.
 18. The MPEG-2 PES packet of claim 17, wherein the PTS is identical to the output time of the metadata AU.
 19. A method for transmitting metadata synchronized with MPEG-2 multimedia content, the method comprising: (a) converting the metadata into a synchronization format suitable for synchronization with the MPEG-2 multimedia content, wherein the synchronization format is MPEG-2 packetized elementary stream (PES) packet including a metadata access unit (AU) and a stream ID for representing the metadata; and (b) converting the synchronization format into a transport format, wherein the transport format is MPEG-2 Transport Stream (TS) packet.
 20. The method of claim 19, wherein the PTS is established referring to an output time of the metadata AU.
 21. A method for decoding a MPEG-2 transport stream which includes a metadata synchronized with a MPEG-2 multimedia content, the method comprising: (a) converting the MPEG-2 transport stream into MPEG-2 packetized elementary stream (PES) packet, wherein the MPEG-2 PES packet includes a metadata access unit (AU) and a stream ID for representing the metadata, and a presentation time stamp (PTS); and (b) decoding the metadata based upon the PTS.
 22. The method of claim 21, wherein the PTS is used to generate an output time of the metadata AU.
 23. The method of claim 21, wherein the MPEG-2 PES packet further includes a decoding time stamp (DTS) which is used for decoding the metadata.
 24. The method of claim 11, wherein the metadata MPEG-2 PES packet further includes a decoding time stamp (DTS) which is used for decoding the metadata. 